Cuneiform Detection in Vectorized Raster Images

نویسندگان

  • Judith Massa
  • Bartosz Bogacz
  • Susanne Krömker
  • Hubert Mara
چکیده

Documents written in cuneiform script are one of the largest sources about ancient history. The script is written by imprinting wedges (Latin: cunei) into clay tablets and was used for almost four millennia. This three-dimensional script is typically transcribed by hand with ink on paper. These transcriptions are available in large quantities as raster graphics by online sources like the Cuneiform Database Library Initative (CDLI). Within this article we present an approach to extract Scalable Vector Graphics (SVG) in 2D from raster images as we previously did from 3D models. This enlarges our basis of data sets for tasks like word-spotting. In the first step of vectorizing the raster images we extract smooth outlines and a minimal graph representation of sets of wedges, i.e., main components of cuneiform characters. Then we discretize these outlines followed by a Delaunay triangulation to extract skeletons of sets of connected wedges. To separate the sets into single wedges we experimented with different conflict resolution strategies and candidate pruning. A thorough evaluation of our methods and its parameters on real word data shows that the wedges are extracted with a true positive rate of 0.98. At the same time the false positive rate is 0.2, which requires future extension by using statistics about geometric configurations of wedge sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multidirectional Scanning Model, MUSCLE, to Vectorize Raster Images with Straight Lines

This paper presents a new model, MUSCLE (Multidirectional Scanning for Line Extraction), for automatic vectorization of raster images with straight lines. The algorithm of the model implements the line thinning and the simple neighborhood methods to perform vectorization. The model allows users to define specified criteria which are crucial for acquiring the vectorization process. In this model...

متن کامل

Automatic Road Extraction from High Resolution Satellite Images Using Neural Networks, Texture Analysis, Fuzzy Clustering and Genetic Algorithms

In this article, a new method for road extraction from high resolution Quick Bird and IKONOS pan-sharpened satellite images is presented. The proposed methodology consists of two separate stages of road detection and road vectorization. Neural networks are applied on high resolution IKONOS and Quick-Bird images for road detection. This paper has endeavoured to optimize neural networks’ function...

متن کامل

Self extracting SVG Rendering of Compressed Raster Images Vectorized by DDT Triangulation

This paper presents a portable compression algorithm applied to raster data images properly triangulated by using DDT (Data Dependent Triangulation). In particular the input source data are encoded to be rendered by standard SVG (Scalable Vector Graphics) engine. The proposed compression strategy works reducing the overall entropy implementing some heuristic strategies to properly re-code the r...

متن کامل

Compressed SVG Representation of Raster Images Vectorized by DDT Triangulation

This paper presents a portable compression algorithm applied to raster data images properly triangulated by using DDT (Data Dependent Triangulation). In particular the input source data are encoded to be rendered by standard SVG (Scalable Vector Graphics) engine. The proposed compression strategy works reducing the overall entropy implementing some heuristic strategies to properly re-code the r...

متن کامل

The Usability of Vectorization and a New Point Matching Procedure as First Step in Conflating Raster and Vector Maps

The growing fields of GIS application (local administration, tourism, archaeology, geology, ...) has increased the interest in studying the information sharing from different geographic databases, also known as “GIS data interoperability”. This is a huge problem, involving several aspects. Among them, we decided to concentrate an a geometric conflation of different maps since maps coming from d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016